White shark optimizer with optimal deep learning based effective unmanned aerial vehicles communication and scene classification

Unmanned aerial vehicles (UAVs) become a promising enabler for the next generation of wireless networks with the tremendous growth in electronics and communications. The application of UAV communications comprises messages relying on coverage extension for transmission networks after disasters, Internet of Things (IoT) devices, and dispatching distress messages from the device positioned within the coverage hole to the emergency centre. But there are some problems in enhancing UAV clustering and scene classification using deep learning approaches for enhancing performance. This article presents a new White Shark Optimizer with Optimal Deep Learning based Effective Unmanned Aerial Vehicles Communication and Scene Classification (WSOODL-UAVCSC) technique. UAV clustering and scene categorization present many deep learning challenges in disaster management: scene understanding complexity, data variability and abundance, visual data feature extraction, nonlinear and high-dimensional data, adaptability and generalization, real-time decision making, UAV clustering optimization, sparse and incomplete data. the need to handle complex, high-dimensional data, adapt to changing environments, and make quick, correct decisions in critical situations drives deep learning in UAV clustering and scene categorization. The purpose of the WSOODL-UAVCSC technique is to cluster the UAVs for effective communication and scene classification. The WSO algorithm is utilized for the optimization of the UAV clustering process and enables to accomplish effective communication and interaction in the network. With dynamic adjustment of the clustering, the WSO algorithm improves the performance and robustness of the UAV system. For the scene classification process, the WSOODL-UAVCSC technique involves capsule network (CapsNet) feature extraction, marine predators algorithm (MPA) based hyperparameter tuning, and echo state network (ESN) classification. A wide-ranging simulation analysis was conducted to validate the enriched performance of the WSOODL-UAVCSC approach. Extensive result analysis pointed out the enhanced performance of the WSOODL-UAVCSC method over other existing techniques. The WSOODL-UAVCSC method achieved an accuracy of 99.12%, precision of 97.45%, recall of 98.90%, and F1-score of 98.10% when compared to other existing techniques.

risky fields and so on.Nevertheless, because of the inadequate energies and computational powers of individual UAVs, it is not possible for ensuring the optimum operational condition at all times 2 , whereas strong connection among the various UAVs to procedure a cluster that could be utilized to achieve different tasks in superior surroundings and complexity 3 .Thus, it progressively becomes a significant form of present applications of UAVs in combat.The UAVs node's superior mobility in FANET (Flying Ad Hoc Network) creates it better frequent for enter and exiting the networks, which can cause complications in the maintenance and establishment of the networks and build it challenging for controlling and managing the UAVs proficiently as their scale raised 4 .Separation of the networks into clusters can support solving the above complications.The separation process is based on dissimilar factors and UAVs are separated into various cluster groups that could be in direct communication with one another and share resources and mediums within the nodes' communication range 5 .
The great performance of the UAV is nominated as Cluster Head (CH) and the other UAVs in the groups are Cluster Members (CM) which can be based on various election considerations 6 .The CH nodes are accountable for inter-and intra-cluster information forwarded in the UAV networks, and then the nodes transmit packets to the CH, which transmits them to the BSs (Base Station) or nodes' destination 7 .Thereby, the control packet will be decreased.Nevertheless, the transmission load of CH can be raised due to it requires to transfer of information between management and also clusters CMs.Consequently, the separation of clusters and the collection of CHs, in addition to the effectiveness of cluster management schemes are crucial for achieving dependable communication and enhancing the network's performance in a hierarchical network.Once the aerial image scenes are obtained, it endures aerial image classification 8 .By the coverage of different earthed objects, the images are classified into subfields and several lands are covered with dissimilar semantic classes.Therefore, the classification of aerial images is a significant process for many real-time applications namely resource managing, metropolitan planning, RS and also computer cartography 1 .The deep Learning (DL) technique is extremely advantageous in the determination of traditional challenges namely Natural Language Processing (NLP), speech recognition, object detection and then a lot of these kinds of real-time applications.It is vastly more proficient than the standard processes and finally, it is also achieved with more consideration in industries and the scientific community 9 .
This article presents a new White Shark Optimizer with Optimal Deep Learning based Effective Unmanned Aerial Vehicles Communication and Scene Classification (WSOODL-UAVCSC) technique.The WSOODL-UAVCSC technique involves two main components: UAV clustering and scene classification 10 .The WSO algorithm is utilized for the optimization of the UAV clustering process enables to accomplish effective communication and interaction in the network.With dynamic adjustment of the clustering, the WSO algorithm improves the performance and robustness of the UAV system.For the scene classification process, the WSOODL-UAVCSC technique involves capsule network (CapsNet) feature extraction, marine predators algorithm (MPA) based hyperparameter tuning, and echo state network (ESN) classification.A wide-ranging simulation analysis was conducted to validate the enhanced performance of the WSOODL-UAVCSC method.
Unmanned Aerial Vehicles (UAVs) have experienced significant advancements in the fields of electronics and communications, rendering them a highly promising facilitator for the forthcoming era of wireless networks.Unmanned aerial vehicles (UAVs) have demonstrated their versatility and efficacy in a wide range of applications, encompassing intelligent systems such as communication and scene classification.Unmanned Aerial Vehicle (UAV) communication presents novel opportunities for entrepreneurs and innovators to investigate a diverse array of practical applications and transformative solutions 11 .The application of unmanned aerial vehicle (UAV) communications encompasses various scenarios, including the extension of coverage for transmission networks in the aftermath of disasters, facilitating communication for Internet of Things (IoT) devices, and enabling the transmission of distress messages from areas with limited coverage to emergency centers.Nevertheless, the task of improving the clustering of unmanned aerial vehicles (UAVs) and the classification of scenes using deep learning methods continues to pose a significant challenge, as the goal is to attain the highest level of performance.This article introduces a novel approach known as the White Shark Optimizer with Optimal Deep Learning-based Effective Unmanned Aerial Vehicles Communication and Scene Classification (WSOODL-UAVCSC) technique in response to the given context 12 .The main objective of the WSOODL-UAVCSC technique is to facilitate the clustering of Unmanned Aerial Vehicles (UAVs) in order to enhance communication efficiency and optimize scene classification.The WSOODL-UAVCSC technique comprises two primary constituents, namely UAV clustering and scene classification 13 .The WSO algorithm is utilized in the UAV clustering procedure to optimize the configuration of UAV clusters and improve communication and interaction within the network.
The performance and robustness of the UAV system are significantly enhanced by the WSO algorithm through the dynamic adjustment of clustering 14 .The scene classification process implemented by the WSOODL-UAVCSC technique involves multiple stages, namely Capsule Network (CapsNet) feature extraction, hyperparameter optimization using the marine predators algorithm (MPA), and classification utilizing the echo state network (ESN) 15 .The utilization of sophisticated deep learning methodologies significantly enhances the precision and effectiveness of scene classification, thereby enabling unmanned aerial vehicles (UAVs) to make well-informed decisions by leveraging the acquired data.The efficacy of the WSOODL-UAVCSC methodology is verified by means of an extensive simulation analysis.The comprehensive analysis of results demonstrates the superior performance of the WSOODL-UAVCSC method in comparison to existing techniques for clustering Unmanned Aerial Vehicles (UAVs) and classifying scenes 16 .The implementation of the WSOODL-UAVCSC technique has the potential to revolutionize wireless communication networks by leveraging UAVs 17 .This advancement allows for enhanced data transmission, improved scene comprehension, and the facilitation of various innovative applications.The results of this study present novel prospects for enhancing communication and scene classification using unmanned aerial vehicles (UAVs), thereby facilitating progress in the domain of intelligent systems and UAV technology.
The impetus behind the creation of the White Shark Optimizer with Optimal Deep Learning based Effective Unmanned Aerial Vehicles Communication and Scene Classification (WSOODL-UAVCSC) method arises from the increasing potential of Unmanned Aerial Vehicles (UAVs) within the realm of wireless networks and intelligent systems.Unmanned aerial vehicles (UAVs) have emerged as multifunctional instruments for a wide range of applications, encompassing communication and scene classification 18 .This development has created prospects for inventive and transformative solutions.The utilization of Unmanned Aerial Vehicle (UAV) communication presents notable benefits, including the expansion of transmission network coverage in the aftermath of disasters, the facilitation of communication for Internet of Things (IoT) devices, and the prompt dispatching of distress messages from areas lacking coverage to emergency centers.Nevertheless, there exist certain obstacles when it comes to improving the efficacy of UAV clustering and scene classification through the utilization of deep learning methodologies in order to attain the most optimal results 19 .
The WSOODL-UAVCSC technique has been developed to tackle these challenges through the introduction of a novel optimization approach that utilizes the White Shark Optimizer (WSO) for UAV clustering.The primary objective of the technique is to enhance performance and robustness within the network by effectively clustering UAVs, thereby improving communication and interaction 20 .The methodology comprises of two primary elements: Unmanned Aerial Vehicle (UAV) clustering and scene classification.The utilization of the WSO algorithm is employed to optimize the process of clustering Unmanned Aerial Vehicles (UAVs), with the aim of dynamically adjusting the clustering in order to enhance the overall performance of the system.Furthermore, the process of scene classification integrates sophisticated deep learning methodologies, including Capsule Network (CapsNet) for feature extraction, hyperparameter optimization through the marine predators algorithm (MPA), and classification utilizing the echo state network (ESN).The conducted simulation analysis serves to validate the performance of the WSOODL-UAVCSC approach, showcasing its enhanced capabilities in comparison to current techniques 21 .The integration of WSO optimization, feature extraction based on deep learning, and advanced classification techniques yields enhanced outcomes in tasks related to clustering and classification of UAVs and scenes.The primary objective of the WSOODL-UAVCSC technique is to leverage the capabilities of unmanned aerial vehicles (UAVs) in wireless networks and intelligent systems through the optimization of UAV clustering and scene classification procedures 22 .The proposed approach aims to enhance the performance and efficiency of unmanned aerial vehicle (UAV) communication applications, thereby creating opportunities for diverse real-world applications and novel solutions.
Due to advances in electronics and communications, UAVs may enable the next generation of wireless networks.Intelligent systems use UAVs for scene classification and communication.UAV communication enables coverage extension for transmission networks after disasters, Internet of Things (IoT) devices, and sending distress messages from devices in coverage holes to emergency centers.Using deep learning to improve UAV clustering and scene classification is difficult.The White Shark Optimizer with Optimal Deep Learning based Effective Unmanned Aerial Vehicles Communication and Scene Classification (WSOODL-UAVCSC) solves these issues 23 .The WSOODL-UAVCSC method clusters UAVs for communication and scene classification.It includes UAV clustering and scene categorization.The White Shark Optimizer (WSO) method optimizes UAV clustering for network efficiency.WSO dynamically adjusts clustering to improve UAV system performance and reliability.WSOODL-UAVCSC scene classification requires numerous phases.First, CapsNet extracts scene features.The marine predators algorithm (MPA) optimizes CapsNet performance by modifying hyperparameters.Finally, the echo state network (ESN) classifies scenes.A comprehensive simulation investigation validates the proposed approach.The analysis shows that WSOODL-UAVCSC outperforms other methods 24 .The research addresses UAV clustering and scene classification difficulties utilizing deep learning for effective communication and scene analysis.The WSOODL-UAVCSC algorithm improves UAV clustering and scene classification performance.

Outcomes of the proposed methodology
The WSOODL-UAVCSC disaster management UAV clustering and scene categorization method delivers numerous major results:

Organization of paper
The rest of the paper is structured in the following manner.Section "Related works" e presents a comprehensive examination of the relevant literature and the methodology utilized in this research endeavor.In Section "Proposed methodology", a comprehensive overview of the workflow utilized in the proposed study is provided, along with a detailed explanation of pertinent concepts.The fourth section of the paper is dedicated to the Simulation Setup and Parameters, Performance Metrics, and the comparative analysis of the results obtained.And, finally section "The shot vector is pushed towards and the Long vector is pushed towards by the squashing function."concludes the paper with future scope.

Related works
Pustokhina et al., 25 (2021) presented a new energy-effective cluster-based UAV with a DL-based scene classification (SC) approach.Primarily, the UAVs were clustered utilizing the T2FL approach because of RE, UAV degree, and distance to adjacent UAVs.Afterwards, the selected CHs transfer the captured images to BSs.Second, the DL method-based ResNet_50 system can be exploited for SC.For tuning the hyper-parameters of the ResNet_50 approach, a water wave optimizer (WWO) system can be employed.Finally, the KELM technique was utilized for performing the SC method.Rajagopal et al. 26 , (2020) presented a novel multi-objective PSO (MOPSO) approach for developing recent DCNNs (Deep Convolutional Neural Networks) in SC, which creates the non-dominant solution.This process assists to attain a tradeoff between the inference latency and classification performance, called multi objective convolutional neural network (MOCNN).Li et al., 27 (2018), discussed a new super pixel-based feature was presented in this case to distinguish UAV images.Based on the presented feature, a scene detection approach of the BoW method for aerial imaging was planned.The presented super-pixel-based feature which employs landform data introduces top-task super-pixel extraction of landforms to bottom-task expression of feature vectors.Guo et al., 28 (2021), presented an enhanced approach to deep reinforcement learning for unmanned aerial vehicle (UAV) navigation in environments characterized by high levels of dynamism.The proposed methodology demonstrates a higher level of convergence and effectiveness.
Uthayan et al., 29 (2022) presented a novel DL-enabled aerial SC approach for UAV-aided MEC methods.The projected method allows the UAVs for capturing aerial images that are transferred to MEC for more processing.A shuffled Shepherd Optimizer (SSO) system was carried out for accomplishing this and to define the hyper-parameters of the CapsNet approach.At last, the BPNN classification approach was executed to define the suitable classes of aerial imagery.Li and Zhou 30 (2021), the authors deal with scene detection by learning the representation of features automatically in big image instances.Primarily, the authors present a novel system for scene detection using trained a slight-weight CNN (Convolutional Neural Network) which completely takes minimal complex and better network structure and is trainable in the approach of end-to-end.Secondarily, the authors present to use of a salient region-based technique for extracting the local feature representation of certain scene areas directly in the convolutional layer dependent upon the self-selection process, and all the layers apply a linear function with an end-to-end approach.
Xia et al., 31 (2021), a novel lightweight method dependent upon VGG16 was presented for extracting various features of RSI by 5 convolutional elements.This method utilizes depthwise separable convolutional for reducing the network limitations.The pooling layer was added for solving the inherent non-adaptive issue of convolutional networks.The global average-pooling layer can be employed to sum the data for making an input spatial transformation further stable.
Ming et al., 32 (2021), for scene categorization in UAV remote sensing photos, the research suggested an unsupervised self-adaptive deep learning classification network.Both the Attention U-Net and the Mask RCNN performed well in classification when it came to describing finer details.Classification networks based on unsupervised adaptive learning are used both for classification and Sample retrieval strategy that automatically adjusts to homology and reliability.
Nilakshi and Bhogeswar 33 (2021), the study presented a novel methodology for feature selection in aerial scene classification, utilizing mutual information as the basis for efficient transfer learning.The presented study introduced an innovative approach for feature selection, utilizing mutual information as the primary criterion and enhanced transfer learning in the domain of aerial scene classification.
Yu et al., 34 (2021), presented on development of a guidance algorithm based on deep reinforcement learning, specifically designed for collision avoidance in fixed-wing unmanned aerial vehicles (UAVs).The research does not address aspects related to communication or scene classification.This paper introduced a computational guidance method for collision avoidance in limited airspace for multiple fixed-wing UAVs, utilizing deep reinforcement learning techniques.The algorithm under consideration demonstrated a high level of efficacy in mitigating the likelihood of collisions among multiple unmanned aerial vehicles (UAVs), even when the number of aircraft involved is substantial.The application of deep reinforcement learning in the context of collision avoidance.The presented study aims to explore an extension of the actor-critic model within the context of reinforcement learning.

Paper Methodology Contribution
Sarfraz, Ahmed, Dakhan 50 (2022) The suggested approach for ensemble learning, which utilises multiple objective particle swarm optimisation, demonstrates enhancements in subject-independent emotion identification based on EEG data The present study introduces a novel ensemble learning approach that demonstrates superior recognition performance compared to previous methodologies The research uncovered numerous cutting-edge methods, including unmanned aerial vehicles (UAVs), deep learning, scene classification, and reinforcement learning, among others.However, a significant technical void exists in the integration of multiple methods to comprehensively address complex real-world circumstances.Although a number of studies have focused on features such as energy-efficient clustering, scene classification, and collision avoidance, there has been surprisingly little research into comprehensive solutions that incorporate all of these elements.The lack of cohesive frameworks that integrate advanced approaches for tasks such as autonomous navigation, communication optimisation, and dynamic scene interpretation is one of the obstacles that must be surmounted in order to achieve efficient and adaptable UAV operations.In addition, standardised evaluation criteria and benchmark datasets are still required to facilitate the effective comparison and validation of proposed approaches, despite the progress made in certain fields.
To bridge this technical chasm, a concerted effort towards the development of integrated, multifaceted solutions that capitalise on the strengths of each approach is required.These solutions must efficiently manage the complexities of UAV applications in the actual world.A variety of innovative methodologies involving UAVs, deep learning, scene classification, and reinforcement learning emerged from the research survey.However, a significant technical void exists in the integration of these approaches to comprehensively address complex real-world scenarios.Despite the fact that a number of studies have focused on particular aspects such as energy-efficient aggregation, scene classification, and collision avoidance, there has been limited investigation into holistic solutions that combine these elements.The absence of cohesive frameworks integrating advanced techniques for tasks such as autonomous navigation, communication optimisation, and dynamic scene comprehension is a barrier to achieving seamless and adaptable UAV operations.In addition, despite the progress made in individual disciplines, there is a need for more standardised evaluation metrics and benchmark datasets to facilitate the comparison and validation of proposed methodologies.Closing this technical gap requires a concerted effort to develop integrated, multi-faceted solutions that leverage the assets of each approach to effectively address the complexities of UAV applications in the real world.

Proposed methodology
In this article, we have focused on the development of the WSOODL-UAVCSC for effective transmission and scene classification in the UAV network.The major aim of the WSOODL-UAVCSC technique is to cluster the UAVs for efficient communication and scene classification.The WSOODL-UAVCSC technique involves two main components: UAV clustering and scene classification.Figure 1 depicts the overall procedure of the WSOODL-UAVCSC method.The WSOODL-UAVCSC methodology is a comprehensive framework that has been developed to tackle the issues associated with communication and scene classification in Unmanned Aerial Vehicle (UAV) systems.This methodology takes a multi-faceted approach to address these challenges.The present methodology incorporates a range of sophisticated methodologies and algorithms in order to optimise the effectiveness of unmanned aerial vehicle (UAV) networks during disaster response situations.The WSOODL-UAVCSC framework encompasses a series of distinct stages: The methodology commences with the application of the White Shark Optimizer (WSO) algorithm, which facilitates the optimisation of Unmanned Aerial Vehicles (UAVs) clustering.The technique exhibits dynamic properties by adapting the clustering process to optimise communication and interaction within the network.The objective is to optimise performance and resilience, which are of utmost importance in situations of catastrophic events.
The WSOODL-UAVCSC framework utilises Capsule Networks (CapsNet) for the purpose of feature extraction.This is subsequently followed by the application of the Marine Predators Algorithm (MPA) to perform hyperparameter tuning.Finally, the Echo State Network (ESN) is employed for scene categorization.The objective of this multi-layered deep learning methodology is to effectively categorise situations that have been recorded by unmanned aerial vehicles (UAVs), which is a crucial component in the field of disaster management.

System model
Phase I: clustering process using the WSO algorithm The WSO algorithm is utilized for the optimization of the UAV clustering process and enables to accomplish effective communication and interaction in the network.With dynamic adjustment of the clustering, the WSO algorithm improves the performance and robustness of the UAV system.
The maximum speed of a UAV reaches up to 30m/s .All the UAV devices are based on the location-aware module which enables the routing technique to be an efficient and precise function.Generally, position data was obtained from the alternate system.In this work, GPS and inertial measurement units are provided for the deployment and motion sensing of UAVs.Every UAV is aware of its BSs and neighbours' location.All UAVs are equipped with short and long-range wireless transmissions.For intra-transmission, short-range wireless transmission is applied with the peers in the cluster.For inter-cluster transmission, long-range wireless transmission is applied with its BSs and other CHs.

Design of WSO algorithm
WSO is a metaheuristic optimization approach affected by the attributes of white sharks namely their sense of smell while foraging and navigating and their exceptional hearing 44 .The steps for the WSO algorithm are given as follows: Movement speed toward prey.Once a white shark identifies the prey position based on the waves generated by the activities of the target: In Eq. ( 1), the index i(i = 1, 2, . . ., n) formulates the white shark command in the population of size n, s sig- nifies the speed, p shows the current location vector of i th white sharks, P gbest shows the high strategic standing vector, P best indicates the present optimum location obtained so far, c 1 and c 2 are two random numbers between [0, 1], p 1 , p 2 , and u are evaluated by using Eqs.( 2), (3), and (4): The movement towards optimal prey: once they smell the fragrance of the target or see the prey movement or they presumably identify the waves caused by the prey movement, white sharks continuously travel towards the prey.The prey either leaves or escapes its position to find food.But still, there is the fragrance in that location.Consequently, the position was updated by the white shark: In Eq. ( 5), a and b represent a 1D binary vector,high and low denotes the upper and lower random search bounds, f refers to the frequency of the wave movement, and mv can be defined as follows: Let a 0 and a 1 be the two constant parameters.The movement towards the white shark: The formula for this phase is provided as follows: (1) where r 1 , r 2 , and r 3 represent the random value ranges within [0, 1] , and D shows the distance between the targets and the sharks.
Fish school behaviours: this phase was modelled by Eq. ( 9):

Process involved in clustering technique
The WSOODL-UAVCSC method measures a fitness function by adding various parameters.The WSOODL-UAVCSC technique is developed with the existence of four fitness parameters such as UAV nodes, average distance of UAVs for CHs enclosed by the sensing range, distance in CH to sink, and energy efficiency of cluster node density 45 .The data on fitness parameter was shown as follows: Energy efficiency: The CH performs diverse activities namely sense, gathered, aggregation, data broadcast, etc.; thus, when compared to other nodes, CH intakes a considerable amount of energy.Next, it is essential to determine an FF that shared the load amongst UAVs from the network: In Eq. ( 10), CH opt indicates the optimal percentage of CHs, R e , Aνg e , and n i indicate the node RE , the average energy of the network, and the overall amount of nodes in UAV, correspondingly.
Cluster node density: the cost is a key parameter for the higher energy efficacy of the network During intracluster transmission.As soon as the cost function of the cluster was defined, then the deployment of network energy becomes larger as follows: where n CH j indicates the quantity of UAVs from the range of CH j the CH j .The value of objective function f 2 is better than the effective selection of CH and exploited from the energy deduction.
The average distance of UAV to the CHs within the sensing range: In intra-cluster transmission, UAV transmits data to the CH.The energy of UAV reduces, once the CH is far away from the CM; there is a deployment of low energy afterwards the CHs is nearer to the member UAV nodes, In Eq. ( 12), n sr and dist (CH, i) show the amount of CH from the sensing sequence of the cluster and UAVs from the sensing range and Euclidean distance in nodes.Therefore, the value of f 3 is minimal; but, the intra- cluster transmission energy can be declined.
Distance from CH to BS: The distance between CHs and BSs takes a crucial function as if the CHS is distant from the sink and quickly exploits energy as follows: In Eq. ( 13), dist(BS, cH) shows the Euclidean distance between CH i and BS .Minimizing the f 4 objective func- tion displays that the CHs are not far from BSs.Once the f 1 , f 2 , f 3 , and f 4 parameter functions are calculated, then the objective function is called FF and evaluated by Eq. ( 14): where α, β, γ , and δ correspondingly indicate the weight coefficient for f 1 , f 2 , f 3 , and f 4 FF parameters, The weight coefficient ranges between [ 0, 1].

Architecture and working
Phase II: scene classification process For the scene classification process, the WSOODL-UAVCSC technique involves CapsNet feature extraction, marine predators algorithm (MPA) based hyperparameter tuning, and ESN classification.

CapsNet feature extraction
The CapsNet model is used for extracting features from the images.CapsNet (the capsule network) uses vector-wise" encoding, where items are encoded by capsules (collections of neurons).It assists to fix the location of ( 9) Vol.:(0123456789) objects and manage the relationship between them 46 .It resolves the problems of information loss caused by the pooling layer in CNN namely scale, location, size, and rotation.
A capsule is composed of a matrix or pose vector for encoding the object's instantiation of activation and different layers parameters.The instantiation parameter changes as the viewing circumstance change, however, the capsule remain active.With the capability of assigning parts to wholes, invariance, and equivariance are two qualities that are used to construct visual hierarchical connections.Figure 2 illustrates the infrastructure of CapsNet.
CapsNet simulates visual hierarchical relationships due to the "Dynamic routing" technique.In CapsNet, dynamic routing is used for establishing visual hierarchical relationships through the technique named "routing-by-agreement" to repeatedly route data transition from low to high-level capsules that is the central idea of dynamic routing in CapsNet.
Initially, the ReLU function is activated with 256 filters and takes the parameter of size 9×9 with a stride of 1.The feature was passed to the primary capsule through this function.CapsNet involves three different mechanisms: • Squash function, • Convolution, and The input is provided to the convolutional layer during the convolution process for generating a list of "feature maps".Here, this feature map was reshaped by the Reshaping function.At last, the entire vector's length is kept inside the range of 0 and 1 , based on the squash function.Because it signifies the probability that an item will be found at a particular place in the image and it does not cause the positional data contained in a high dimensional vector to be destroyed .
Consider that l and l + 1 layers have m and n capsules, correspondingly.The activation of the capsules at thel + 1 layer was computed based on the activation at the l layer.The letter u represents capsule activations at thel layer.We should evaluate v , the capsule activation, at thel + 1 layer .
For a j th capsule at l + 1 layers .
1.At the l layer, the capsule was used to evaluate the prediction vector.The prediction vector for j th capsule ( l + 1 layer) produced by i th capsules ( l layer) is: In Eq. ( 15), W ij is the weight matrix.2.Here is the output vector for thej th capsules that are evaluated.The output vector for thej th capsule is the sum of the weight of each prediction vector supplied by l layer capsules: 3. Scalar c ij signifies the coupling coefficient between capsules i ( l layer) and j ( l + 1 layer).The technique named iterative dynamic routing technique defines this coefficient.4. The squashing function is used to the output vector for obtaining v j activation of the j th capsule: 5.The shot vector is pushed towards 0 and the Long vector is pushed towards 1 by the squashing function.

Hyperparameter tuning
For adjusting the hyperparameters related to the CapsNet model, the MPA is used.MPA is a bio-inspired metaheuristic technique proposed to overcome complex optimization problems by using biological processes and natural events 47 .The foraging strategy of marine predators in the wild serves as a basis for the mathematical modelling of MPA.MPA accommodates the Brownian statistical and Lévy distributions.The Brownian technique makes the consistent and systematic progression through the search space, whereas The Lévy search method (15)   www.nature.com/scientificreports/includes traversing space with the sequence of prominent hops.The Brownian search process guarantees visit to remote places.This phenomenon has drastically improved the search abilities of MPA.
In the MPA method, the movement equation is the most important.It directs how the predator moves around the solution space.This can be formulated as follows: In Eq. ( 18), x i (t) shows the position of the i th predator at t time , v i (t) indicates the velocity of the i th predators at t time, and t shows the existing iteration of the model.The MPA's strength lies in its adaptability to multi-modal and fast convergence to optimum solutions and massively parallel optimization problems.The technique requires parameter tuning and might be stuck in the local optima.
The MPA method not only derives a fitness function to attain higher efficiency of classification and also describes a positive integer to represent the better outcome of the solution candidate.The decline of the classification error rate is considered a fitness function.

Image classification
Finally, the ESN model classifies the input images into distinct class labels.ESN comprises 3 layers such as output, reserve, and input layers.Since the weighted matrix of the input layer and internal connection matrix of the reserve pool (RP) can be arbitrarily created and set, the computational count of trained methods is decreased 48 .
The ESN resolves the fitting regression time sequence problems by exchanging the FC hidden state with spare connection RP; the upgrade layer of the network together with the resultant formula as: whereas tanh denotes the activation function and is utilized for obtaining the network echo features, a denotes the rate of leakage utilized for controlling the upgrade weighted of ESN network, W in stands for the matrix of input weighted arbitrarily created in the range of 1 and 1, R implies the connection matrix with sparse design inside the RP, u(t) defines the input at time t, x(t) stands for the t-moment layer of the RP, and y(t) indicates the outcome at time t .The resultant matrix W oui of the ESN is resolved using ridge regression with the subsequent optimizer objectives: whereas, stands for the regularized co-efficient utilized for preventing over-fitting in the ESN-trained set, and I represent the identity matrix.The forecast data can be replaced as Eqs.(20) and (21) to acquire the last forecast outcome.
The ESN design is easy and practical; but its forecast outcome was affected by parameter settings, like the RP connection matrix scaling parameter represented by R h , N denotes the count of RP network nodes, I S denotes the input data scaling co-efficient, S implies the RP sparsity degree, and a refers to the leakage value.Employing suitable parameter settings efficiently improves the forecast ability of the ESN.

Accuracy
The accuracy of a classification model is determined by calculating the proportion of correctly predicted instances, which includes both true positives and true negatives, relative to the total number of instances present in the dataset.From a mathematical standpoint, it can be formulated as follows:

Precision
Precision is a metric that serves as an indicator of the performance of a machine learning model.It specifically measures the quality of positive predictions made by the model.Precision is a metric that quantifies the proportion of accurate positive predictions in relation to the total number of positive predictions.It is calculated by dividing the number of true positives by the sum of true positives and false positives.

Recall
The recall metric is determined by dividing the number of correctly classified Positive samples by the total number of Positive samples.The recall metric quantifies the model's capacity to accurately identify positive samples.
There is a positive correlation between recall and the number of positive samples detected.

F1-score
The F1 score can be defined as the harmonic mean of precision and recall, thereby offering a well-balanced evaluation of the model's efficacy by incorporating both metrics.Precision is a metric that quantifies the ratio of accurately predicted positive instances (true positives) to the total number of positive predictions made by the model.In contrast, recall quantifies the ratio of correctly identified positive predictions to the total number of positive instances present in the dataset.

Result analysis
In this section, the clustering and scene classification outcomes of the WSOODL-UAVCSC technique are examined.The scene classification results of the WSOODL-UAVCSC technique are tested on the UCM dataset 49 .This is a 21-class land use image dataset with 100 images of each class.Each image measures 256 × 256 pixels.values.Although the T2FL model illustrates considerable performance, the WSOODL-UAVCSC technique demonstrates superior results with the least values of ECOM.Table 2 and Fig. 4 show the end-to-end delay (ETED) effects of the WSOODL-UAVCSC approach with present systems.The outcomes exposed that the TIFL method demonstrates worse results with maximal ETED values.Simultaneously, the KHA and MPSO methods acquired moderately increased performance with enough ETED values.Though the T2FL system demonstrates significant performance, the WSOODL-UAVCSC method exhibits greater outcomes with minimum values of ETED.
Figure 6 shows the training accuracy TR_accu y and VL_accu y of the WSOODL-UAVCSC approach.The TL_accu y is described by the estimation of the WSOODL-UAVCSC system on the TR database however the VL_accu y is computed by calculating the performance on an individual testing database.The outcomes dem- onstrated that TR_accu y and VL_accu y raising with an upsurge in epochs.Accordingly, the performance of the WSOODL-UAVCSC systems acquires to enhance the TR and TS database with an increase in many epochs.
In Fig. 7, the TR_loss and VR_loss effects of the WSOODL-UAVCSC method are exposed.The TR_loss deter- mined the error between the predicted performance and original values on the TR dataset.The VR_loss signify the estimation of the performance of the WSOODL-UAVCSC approach on a separate validation dataset.The outcomes denoted that the TR_loss and VR_loss tend to reduce with increasing epochs.It depicted the greater performance of the WSOODL-UAVCSC system and its proficiency to produce an accurate classification.The diminished value of TR_loss and VR_loss exhibits the improved performance of the WSOODL-UAVCSC proce- dure on capturing patterns and relationships.
A short precision-recall (PR) analysis of the WSOODL-UAVCSC system is established on the test database in Fig. 8.The outcomes stated that the WSOODL-UAVCSC system outcomes in maximum values of PR.Furthermore, it is perceptible that the WSOODL-UAVCSC approach can achieve greater PR values on all class labels.In Fig. 9, a ROC investigation of the WSOODL-UAVCSC model is shown on the test dataset.The figure defined that the WSOODL-UAVCSC method resulted in the enhancement of ROC values.Additionally, the WSOODL-UAVCSC system can increase ROC values on all class labels.Table 4 and Fig. 10 inspect the scene classification results of the WSOODL-UAVCSC technique with other recent models 10 .The experimental values highlighted that the VGGNet, VGG-RBFNN, CA-VGG-LSTM, Goog-leNet, and CA-GoogleNet-LSTM models have obtained poor performance over other models.Simultaneously, the C-PTRN method has shown slightly improved results with accu y , prec n , reca l , and F score of 98.67%, 91.65%, 97.45%, and 93.26% respectively.However, the WSOODL-UAVCSC technique gains maximum performance with accu y , prec n , reca l , and F score of 99.12%, 97.45%, 98.90%, and 98.10% correspondingly.
The CT results of the WSOODL-UAVCSC technique are compared with recent models in Table 5 and Fig.

Conclusion
This paper emphasises on the advancement of the WSOODL-UAVCSC system, aiming to enhance transmission efficiency and scene classification within the UAV network.The primary objective of the WSOODL-UAVCSC technique is to effectively cluster UAVs in order to optimise transmission and enhance scene classification.The WSOODL-UAVCSC approach comprises two primary constituents, namely UAV clustering and scene classification.The utilisation of the WSO algorithm in the optimisation of the UAV clustering process facilitates the achievement of efficient communication and interaction within the network.The performance and robustness of the UAV system are enhanced through the utilisation of the WSO method, which incorporates dynamic modification of clustering.The picture classification process incorporates the WSOODL-UAVCSC technique, which encompasses CapsNet feature extraction and classification using ESN.A comprehensive simulation analysis was conducted to verify the superior performance of the WSOODL-UAVCSC approach.The comprehensive analysis of the results revealed that the WSOODL-UAVCSC method exhibited superior performance compared to other current approaches.The suggested model has a possible drawback in its susceptibility to variations in hyperparameter configurations, a concern particularly relevant to deep learning architectures such as CapsNet and ESN.Achieving optimal hyperparameter tuning often requires thorough experimentation and dependence on domain-specific expertise.The validation of the method's effectiveness in real-world UAV applications should   be undertaken through the implementation of field testing and trials in future research endeavours.The execution of trials in practical settings including UAV communication and scene classification situations will provide significant knowledge and feedback, hence helping subsequent improvements.
In the future, enhancing the interpretability and explainability of deep learning models utilised for scene categorization and navigation could potentially foster greater trust and acceptance of these methodologies in safety-critical applications.Consequently, this may result in a heightened adoption of these techniques.The examination of novel methodologies for visualising decision-making processes inside these models has the potential to yield UAV systems that exhibit increased transparency and accountability.
. The results indicate that the VGGNet, VGG-RBFNN, CA-VGG-LSTM, GoogleNet, and CA-GoogleNet-LSTM have offered maximum CT values.Next, the C-PTRN model exhibits considerable outcomes with a CT of 1.72s.Nevertheless, the WSOODL-UAVCSC technique offers superior results with the least CT of 0.87s.These results show the betterment of the WSOODL-UAVCSC technique over other models.

Table 1 and
Fig. 3 exhibits the energy consumption (ECOM) outcomes of the WSOODL-UAVCSC technique with present techniques.The results show that the TIFL model shows worse outcomes with maximum ECOM values.At the same time, the KHA and MPSO models obtain slightly boosted performance with moderate ECOM

Table 1 .
ECOM outcome of WSOODL-UAVCSC system with other methods on varying rounds.

Table 2 .
ETED outcome of WSOODL-UAVCSC system with other methods on varying rounds.

Table 3 .
TRHT outcome of WSOODL-UAVCSC system with other methods on varying rounds.

Table 4 .
Comparative outcome of WSOODL-UAVCSC system with other methods.

Table 5 .
CT outcome of WSOODL-UAVCSC system with other methods.